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Abstract — We study the delay minimization in a direct mul- 
ticast communication scheme where a base station wishes to 
transmit a set of original packets to a group of clients. Each 
of the clients already has in its cache a subset of the original 
packets, and requests for all the remaining packets. The base 
station communicates directly with the clients by broadcasting 
information to them. Assume that bandwidths vary between the 
station and different clients. We propose a method to minimize 
the total delay required for the base station to satisfy requests 
from all clients. 

I. Introduction 

We study the issue of delay minimization of the so-called 
Direct Multicast with Side Information (DMSI) problem. In 
an instance of this problem, a base station wishes to transmit 
a set of n original packets to a group of k clients. Each of 
the clients already has in its cache a subset of the original 
packets (referred to as side information), and requests for all 
the remaining packets. The base station communicates directly 
with the clients by broadcasting information to them. 
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Fig. 1: An example of Direct Multicast with Side Information 

Such a scenario is usually observed in opportunistic wireless 
networks (TJ, ||2), where wireless nodes often opportunistically 
overhear packets that are not designated to them. These 
overheard packets become the side information for the nodes . 
This problem also arises in communication schemes where a 



server has to broadcast a set of packets to a group of clients. 
Limited storage capacity, bad reception, or signal degradation 
might lead to packet loss at the clients. Using a slow feedback 
channel, the clients inform the server about their missing 
packets, and request for retransmissions [|3]. 

In our model, each packet that is transmitted from the 
base station, referred to as a broadcast packet, is a linear 
combination of the original packets. Assume that bandwidths 
vary between the base station and different clients, and that 
each broadcast packet is designated for (in other words, 
assigned to) a subgroup of clients. The delay of a broadcast 
packet is defined to be the amount of time that a client (to 
which the packet is assigned) with a minimum bandwidth can 
receive the packet successfully. Our main contribution is to 
provide a method to minimize the total delay required for the 
base station to satisfy requests from all clients. We design an 
optimal packet assignment so as to achieve the minimum total 
delay. Moreover, the multicast scheme with optimal total delay 
can be found in polynomial time in n and k. 
A motivational example. Suppose that there are four clients 
Ci, C2, C3, C4, which miss 2, 1, 3, 5 original packets, 
respectively, as given in Fig Q] By a well-known result in 
network coding (see Section [III] for more details), provided 
that 

• the base station broadcasts at least 5 packets to the clients, 
and 

• the number of broadcast packets designated for each 
client is as many as the number of its missing packets, 

then there is a coding scheme for the base station to satisfy 
demands from all clients simultaneously. Assume that each 
broadcast packet is of size 8KB and that the bandwidth and 
the packet delay from each client are given the table in Fig. [2] 
Note that the delay is obtained by dividing the packet size 
by the bandwidth between the con-esponding client and the 
base station. Suppose that the base station uses five broadcast 
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Fig. 2: Bandwidths and delays for clients 

packets p%,. . ., P5. Consider the Packet Assignment A, given 
in Fig. [3] and the Packet Assignment B, given in Fig. [4] The 
total delay of the Packet Assignment B (20 seconds) is 4 



seconds less than the total delay of the Packet Assignment 
A (24 seconds). In fact, in Section [IV] we can see that Packet 
Assignment B is actually optimal in terms of the total delay for 
this scenario. The intuition is that the total delay gets smaller 
if fewer broadcast packets are assigned to more clients with 
large delays. This is proved later to be true. 
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Fig. 3: Packet Assignment A. A 1-entry means the broadcast 
packet in that row is assigned to the client in that column. 
The delay of a broadcast packet is the maximum delay from 
all clients to which the packet is assigned. 
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Fig. 4: Packet Assignment B 



Related work. The DMSI problem is a special case of the 
Multicast with Side Information (MSI) problem 0]. In an MSI 
instance, there is a network between the base station and the 
clients. Our problem considers the scenario where the only 
communication links are those between the base station and 
the clients. However, the issue of delay minimization is not 
investigated in |4j. 

Lun et al. study the problem of cost minimization 
for a general multicast network. In their setting, each vector 
of rates z at which packets are injected into edges of the 
network corresponds to a cost f(z). The goal is to find z 
that minimizes f(z). The main difference between our result 
and the result in [5| is the following. The authors in 
investigate asymptotic solutions with infinite block length 
codes; in other words, they consider divisible packets with 
infinitely many subpackets (the non-integral setting). In this 
work, we are only interested in network codes of block length 
one; in other words, we only consider indivisible packets (the 
integral setting). From a practical point of view, solutions to 
the integral setting are often preferred due to its simplicity in 
implementation, lower complexity in computation, and smaller 
buffer required at clients. In general, the integral setting might 
be harder to tackle than the non-integral setting (for instance 
linear programming can be solved in polynomial time, whereas 
integer linear programming is NP-hard). However, in our case, 
because of the special objective function (the total delay), 
the optimal solution for the integral setting can be found in 
polynomial time. 



Organization. We formulate our problem rigorously in Sec- 
tion lU A necessary and sufficient condition for the feasibility 
of a multicast scheme is provided in Section [Til] (Lemma [2]). 
In Section [IV] we construct a feasible multicast scheme and 
prove that it has minimum total delay (Lemma[3| Theorem|4]i. 

II. Problem Definition 

A Direct Multicast with Side Information (DMSI) instance 
is described as follows. A base station S has a set of n original 
packets X — {xx, . . . , x n }, where Xi G ¥ q , i 6 [n]. There are 
k clients C\, . . . , Ck- For each j G [k], the client Cj possesses 
a subset of original packets Hj CI as side information, and 
demands all missing packets in X \ Hj. We abbreviate such 
a DMSI instance by M = (ri, {Hj}\). 

A multicast scheme for the instance M = (n, {-Hj}i) is a 
2-tuple (P, A) where 

• P = {pi, . . . ,p m } is a set of broadcast packets, i.e. linear 
combinations of the original packets, that the base station 
broadcasts to the clients, 

• A = (aij) is an m x k binary matrix, where <Zjj = 1 
if and only if the broadcast packet pi is assigned to the 
client Cj, for i G [m], j £ [k]. 

We refer to A as the (packet) assignment matrix. The assign- 
ment matrix determines which clients a broadcast packet is 
assigned to. A multicast scheme is feasible if upon receiving 
all designated broadcast packets, each client can retrieve all 
missing original packets. 

We assume that the client Cj (j G [k]) requires dj seconds 
to receipt a broadcast packet (assigned to it) successfully. We 
refer to dj as the delay from Cj (j G [k]). Furthermore, 
suppose that after broadcasting a packet pi G P, the base 
station can start sending another packet only when all clients 
that pi is designated for already receive pi successfully. We 
define the delay of the packet pi according to the assignment 
matrix A by 

cU(Pi) = max{(i, : <Zjj = 1}. (1) 

Note that the base station must transmit pi at the minimum 
rate among all designated clients so that the client with 
the smallest bandwidth can manage to decode the packet. 
Therefore, the largest delay among the designated clients is the 
bottleneck and dominates the delay for that broadcast packet 
transmission. Therefore, d^ipi) is the amount of time required 
for the broadcast packet pi to be successfully received by all 
designated clients. We define the total delay of a multicast 
scheme (P,A) by 

m 

d^PA) = ]Td A ( K )- (2) 

i=i 

Notice that the total delay can be determined solely from 
the assignment matrix. Therefore, sometimes we use dy>(A) 
instead of dj^(P, A). Our goal is to find a feasible multicast 
scheme with minimum total delay, for a given DMSI instance. 

As an illustrative example, we consider the DMSI instance 
as described in Fig. Q] In this example, n — 6, k — 4, and the 
side information at the clients are given below. 
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Hi = {X1,X 3> X 5 ,X 6 }, H 2 = {X!,X2,X 3 ,X4,X 5 }, 

H 3 = {x 3 ,X4,x 6 }, Hi = {x 4 }. 

The Packet Assignment A and B in Fig. [3] and [4] can be 
incorporated into multicast schemes (P, A) and (P*,A*), 
respectively, where the assignment matrices are given in Fig. [5] 
In Section IIVI we show how to determine the packets in P 
and P* so that (P, A) and (P*,A*) are feasible multicast 
schemes for Ai. Regarding the packet delay, let us examine 
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Fig. 5: The assignment matrices A and A* 



the third broadcast packet p 3 , which is designated for C 3 
and C4, according to A*. The delay from these two clients 
are 2 = 8/4 seconds and 1 = 8/8 seconds, respectively. 
Therefore, 

dA*(p 3 )=max{2,l} = 2. 

The total delay of the matrix A* is calculated as follows. 

5 

d E (A*) = £d A .( ft ) 

i=l 

= max{8, 4, 2, 1} + max{8, 2, 1} + max{2, 1} 

+ max{ 1 } + max{ 1 } 
= 8 + 8 + 2 + 1 + 1 = 20. 

III. Feasibility of a Multicast Scheme via 
Network Coding 

In this section, we establish a necessary and sufficient 
condition for the feasibility of a multicast scheme for DMSI 
via network coding. 

Hereafter, let Ai — (n, {Hj}\) be a DMSI instance. For an 
m x k binary matrix A, we define the network Af(A4, A) as 
follows. The set of nodes of Af(Ai, A) consists of 

> one source node s, which possesses all original packets 
X\ , . . . , x n , 

• n "original packet" nodes si, . . . , s n , each corresponds 
to an original packet, 

> m intermediate nodes 1*1, . . . , u m , 

• m "broadcast packet" nodes V\, . . . , v m , 

• k sinks ti,t2, . ■ . , tfc, each corresponds to a client and 
demands all original packets. 

The set of (directed) edges of Af(Ai, A) consists of 

• (s,Si) with capacity one for all i G [n], 

• (si,tj) with capacity infinity if and only if xi G £Zj, 

• (si,Uh) with capacity infinity for all i e [n], h G [m], 

• (u/i, u/i) with capacity one for every h G [m], 

• (u/j, ij) with capacity one if and only if a^.j = 1. 

As an illustrative example, the network J\f(A4, A*), where Ai 
is given in Fig. [T] and A* is given in Fig. |5bJ is depicted in 
Fig. |6]and Fig. HI 
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Fig. 6: The network Af(A4, A*) with given in Fig. [TJand 
A* given in Fig. [5b] The (side information) edges from Sj to 
tj are depicted in a separate figure (Fig. |7) for a clearer view 




Fig. 7: The side information edges of the network Af(A4, A*) 
in Fig. |6] 

The network N(A4 : A) is called solvable if the source s is 
able to multicast n packets to all k sinks simultaneously by 
using a linear coding scheme (see [6|). 

Lemma 1. Suppose that A is an m x k binary matrix. Then 
there exists a feasible multicast scheme (P, A) with \P\ = m 
for A4 if and only if the network Af(A4, A) is solvable. 

Proof: Assume that there exists a feasible multicast 
scheme (P, A) with \P\ — m for M.. Let P = {pi, . . . ,p m }- 
Then s can multicast n packets xi,...,x n to all sinks in 
Af(M, A) simultaneously using the following coding scheme: 

• s sends Xi to si for every i e [n], 

• Si (i G [n]) sends Xi to tj (j G [k]) if they are adjacent, 

• Si sends Xi to ut for every i £ [n] and ft, e [m], 
> lift sends to for every h £ [m], 

• Vh (h G [m]) sends to ^- (j £ [fc]) if they are adjacent. 
Conversely, assume that the network Af (Ai, A) is solvable. By 
definition, there is a coding scheme so that s can multicast n 
packets to all sinks simultaneously. By applying an invertible 
linear transformation if necessary, we can suppose that s 
sends Xi to Si for every i e [n\. For each h G [m], let 
Ph be the packet transmitted on the edge (uh,Vh). Then it 
is straightforward that (P, A) where P = {pi, . . . ,p m } is a 
feasible multicast scheme for Ai. ■ 

For the instance Ai = (n, {Hj}i), for each j G [fc] let = 
n — \Hj \ denote the number of missing original packets of the 
client Cj. Let wt(A[7']) denotes the number of 1-entries in the 
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jth column of A. A necessary and sufficient condition for the 
feasibility of a multicast scheme is presented in the following 
lemma. We show that it is possible to satisfy demands from 
all clients if and only if each client receives as many broadcast 
packets as its missing original packets. 

Lemma 2. Suppose that A is an m x k binary matrix. Then 
there exists a feasible multicast scheme (P, A) with \P\ = m 
for M. if and only ifwt(A[j]) > Wj for every j G [k\. 

Proof: The condition that wt(A[j}) > Wj for every 
j G [k] is equivalent to the condition that every cut between 
the source s and a sink in j\f(A4 , A) has capacity at least n. 
Due to lack of space, we provide a separate proof for this 
statement in Q. By the well-known result from multicast 
network coding j6], the latter is a necessary and sufficient 
condition for the solvability of j\f(A4, A). By Lemma [TJ we 
finish the proof. ■ 

Lemma [2] implies that if (P, A) is a feasible multicast 
scheme for B then \P\ > m&XjWj. In Section IIV-AI we 
construct a feasible multicast scheme that employs precisely 
max.,- Wj broadcast packets. 

IV. Optimal Packet Assignment 

In this section, we first describe a feasible multicast scheme 
(P*,A*) for a DMSI instance M = {n,{Hj}\), and then 
show that this scheme obtains the minimum total delay among 
all feasible multicast schemes for A4. 

A. The Multicast Scheme (P*, A*) 

Relabeling the clients if necessary, we assume that 

di > d 2 > ••• > d k . (3) 
We consider the multicast scheme (P*,A*), where 

to* = IP* I = maxu> 



je[k] 

and A* = (a* ) defined as follows 



Uj, 



1,3 



1, if 1 < i < Wj, 
0, if Wj < i < to* 



(4) 



(5) 



We already see an example of such an assignment matrix A* 
in Fig. [5b] where M. is given in Fig. [TJ 

The broadcast packets of P* can be obtained as follows. By 
dU and ©, we have wt(A[?]) = Wj for every j G [ft]. There- 
fore, by Lemma[2] there exist broadcast packets p%, . . . ,p m so 
that (P*,A*) with P* = {p%, . . . ,p m *} is feasible. Moreover, 
by the proof of LemmaQ] these broadcast packets can be found 
in polynomial time in n and k, using the algorithm in [8 1, given 
that q > k. For example, let q — 4 and F4 = {0,1, a, a 2 }. 
For A* given in Fig. [5b] the broadcast packets of P* can be 
chosen as follows: p\ = ax^+x^+^x^+axg, P2 = X1+X2 + 
^x^+ax^+x^+XQ, P3 = axi+a 2 X2+X3+ax4+xc ) +a 2 Xe, 
Pi = xi+ct 2 X3+ctXi+ct 2 XQ, = a 2 xi+ax2+X3+ax4+X5. 



B. The Optimality of (P*,A*) 

Now we prove the optimality of (P*,A*) in terms of 
the total delay. Let (P, A) be an arbitrary feasible multicast 
scheme for j\A. Our goal is to show that d^(^4) > d^(^4*). 

By Lemma [2] the feasibility of (P, A) implies that 
wt(A[j]) > Wj for every j G [k]. Since flipping a 1-entry 
into a 0-entry does not increase dy^(A), we may assume that 
wt(A[j]) = Wj for every j G [k]. In Lemma [3] we show that 
the total delay of (P, A) is not smaller than that of (P*,A*). 
First, we illustrate the idea of Lemma [3] via an example. 

Consider the DMSI instance M. given in Fig. [T] together 
with the delays from the clients given in Fig. [2] Let A and 
A* be the assignment matrices given in Fig. [5j We now show 
that dy;(A) > dj^(A*) using an algorithmic approach. We 
modify A through several steps so that finally, A is turned 
into A*. Moreover, in every step, dy^(A) is never increased. 
Step 1. We permute the second and the third row of A. 
Obviously, d^r(A) remains unchanged. The matrix now is 
given in Fig. [8a] We can see that the first columns of A and 
A* are now the same. 

Step 2. We shift the only 1-entry in the second column of 
A all the way up to the first row, by swapping a\ t 2 and 
03,2- The matrix now is given in Fig. [8b] As d% > d^, the 
broadcast packet p\, which corresponds to the first row of 
A, still remains to be d\ after the aforementioned swap. As 
03 2 is now zero, the delay of the third packet is decreased to 
di < c?2- These are the only changes in the total delay of A 
after this step. Therefore, dy^(A) is not increased (in fact, it 
is decreased by 3 seconds). Now the first two rows of A and 
A* are the same. 

Step 3. We first swap 02.3 and 04,3. The delay of the second 
broadcast packet is still di after the swap. The delay of the 
forth broadcast packet, from d$, is now decreased to < d%. 
Next, we swap the third and the fifth row of A. The total delay 
of A is unchanged. The matrix now is given in Fig. [8c] The 
first three rows of A and A* are the same. Their forth rows 
are also identical. 
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(a) A after Step 1 

Fig. 8: 



(b) A after Step 2 

A is turned into A* 



(c) A after Step 3 

in three steps 



Lemma 3. Let A be an mxk binary matrix where wt( A[j]) = 
Wj for every j G [k]. Then dj^(A) > d^(A*). 

Proof: Since wt(A[j]) = Wj for all j G [k], we have 

to > max w., = m*. 
je[fc] 

The idea is to repeatedly modify the matrix A through k + 1 
steps, so that at each step, the total delay of A is not increased. 
At the final step, A is turned into A*. As the total delay 
never goes up during the whole process, we conclude that 
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Hereafter we say that the two column vectors u e F™ and 
v G F™ are almost identical if their first m* coordinates are 
identical and the last m — m* coordinates of u are all zeros. 
Step 1. As wt(A) = wt(A*) = W\, we can permute the rows 
of A (if necessary) so that the first columns of A and A* are 
almost identical. As permuting rows does not affect the total 
delay, after Step 1, the total delay of A remains the same. 
Step j (2 < j < k). Suppose that up to Step j — 1, the first 
j — 1 columns of A and A* are almost identical. In this step, 
we modify A so that the jth columns of A and A* become 
almost identical. Intuitively, we shift all of the 1-entries in the 
jth column of A upward as much as we can, and prove that 
during the process, the total delay of A is not increased. Let 

U(j) = {ie [to] : 3f < j s.t. a itj > = 1}, L(j) = [m]\U(j). 

In words, U(j) denotes the set of upper rows of A, each of 
these contains at least a 1-entry that is located within the first 
j — 1 columns. Note that U(j) consists of the first |t/(j)| 
rows of A. In opposite, L(j) denotes the set of remaining 
lower rows of A, where all entries in these rows that are 
located within the first j — 1 columns are zeros. The following 
modifications to A do not increase d^(A) and at the same 
time, keep the first j — 1 columns of A unchanged. 
(Ml) Modify the entries in the jth column that are located 
within the first |C/(j)| rows. As the first j — 1 columns 
of A and A* are almost identical, the delays (w.r.t A) 
of the first |?7(j)| broadcast packets are from the set 
{di, . . . , rfj-i}. Since dj < dji for all j' < j, any 
change in the jth column within the first |<7(j)| rows 
does not affect the delays of the corresponding packets. 
(M2) Turn a 1-entry in the jth column that are located within 
the last \L(j)\ rows into a 0-entry. The delay of the 
corresponding broadcast packet is changed from dj to 
dj" for some j" > j. As dj>> < dj, the packet delay is 
not increased. 

(M3) Permute rows in L(j). It is obvious that permuting rows 
in A does not affect dyj(A). Moreover, by definition of 
U(j) and L(j), permuting rows within L(j) does not 
affect the first j — 1 columns of A. 
With (Ml), (M2), and (M3) in mind, we now apply some 
modifications to A. Within the first |<7(j)| rows, in the j 
column of A, we swap pairs of 0- and 1-entries such that 
the 0-entries are below all the 1-entries. Due to (Ml), d^(A) 
remains unchanged. We next consider two cases. 
(CI) The jth column of A has no 1-entries in the last \L(j)\ 
rows. Then we are done for Step j since now the jth 
column of A is already almost identical to that of A*. 
(C2) The jth column of A has some 1-entries in the last 
\L{j)\ rows. We now examine only the entries in the 
jth column of A. 
a) If there are as many 0-entries in the upper part 
U(j) as 1-entries in the lower part L(j) then we 
can shift the 1-entries all the way up by applying 
appropriate entry swaps; and doing so makes the 
jth columns of A and A* almost identical. By 
(Ml) and (M2), d^(A) is not increased. 



b) If there are fewer 0-entries in the upper part U (j) 
than 1-entries in the lower part L(j), we first shift 
as many as we can the 1-entries from L(j) to U(j); 
then all entries in U(j) are one. By (Ml) and (M2), 
d^2(A) is not increased. Finally, we permute rows 
in L(j) so that in the jth column of A, the 1-entries 
lie above all the 0-entries. Then the jth columns 
of A and A* are almost identical. Moreover, by 
(M3), d^p(A) is unchanged. 
Step k + 1. The previous k steps guarantee that d^(A) is not 
increased and all k columns of A are almost identical to that 
of A*. Therefore, the last m* —m rows of A are all-zeros. In 
this step, we remove the last m* — m rows of A, to turn A 
into A* . Certainly, d^(A) remains unchanged in this step. ■ 

Theorem 4. The multicast scheme {P*,A*) obtains the 
minimum total delay among all feasible multicast schemes for 
M = (n, {Hj}i). Moreover, 

k 

dj-~(P*,A*) = ^ dj x max {0, Wj — max{u)j} <j/ <: , } , (6) 

where we set wq = 0. 

Proof: The first assertion follows by Lemma [3] To prove 
that (|6]l holds, we show that there are 

max {0, Wj — maxjiUj-lcKj'^ } 
broadcast packets that have delay dj . Obviously, the first 

w\ = max{0, wi — max{wo}} 
broadcast packets have delay di, due to (fTJ and ([3J. By the 
definition of (P* , A*), for each j > 1, there are precisely 

max {0, Wj — max{u>j-}o<j'<j } 
broadcast packets that are assigned to Cj but to none of the 
clients Cy with j' < j. Due to (Q~|) and (0), these are the only 
broadcast packets that have delay dj . ■ 
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